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TO:  SEE  DISTRIBUTION 


1,  Attached  are  executive  summaries  of  four  manpower  computer  models 
developed  by  the  Army  during  a  three  year  study  effort  under  contract 
to  General  Electric  Co  (TEMPO).  The  study  was  initiated  in  June  1972 
with  the  objective  of  improving  the  Army's  capability,  given  alternative 
situations  and  policies,  to  predict  the  active  duty  strength  in  terms  of 
officers  and  enlisted  personnel  at  specified  dates  in  the  future.  The 
study  was  completed  on  1  July  1975. 

2.  Overview  of  the  models  and  proponent  offices  at  Department  of  the 
Army  for  each  are  as  follows: 

a.  Automatic  Interaction  Detector  -  Enlisted  (AID-E)  a  loss  rate 
generator  that  employs  individual  attributes  (e.g.,  time  left  in  service, 
educational  level,  mental  group,  etc.)  as  a  basis  to  predict  loss 
behavior.  The  AID  technique, developed  by  the  University  of  Michigan,  is 
a  powerful  statistical  routine  which  appears  to  have  wide  applicability 

in  evaluating  the  causes  for  many  types  of  human  behavior.  Proponent  office 
at  Department  of  the  Army  is  the  Manpower  Programs  Division,  Directorate  of 
Plans,  Programs  and  Budget,  Deputy  Chief  of  Staff  for  Personnel  (DCSPER)  — 
Mr.  Petruzzi,  OX  5-3857.  J  \j  c  ( 

b.  Automatic  Interaction  Detector  -  Officer  (AID-0)  a  loss  rate 

generator  for  officers  comparable  to  AID-E.  Proponent  office  is  Officer 
Division,  Directorate  of  Military  Personnel  Management,  DCSPER  --  MAJ 
Fishback,  OX  7-6873.  yc,/; 

I 

c.  Central  Integrating  Model  -  Enlisted  (CIM-E)  a  force  projection 
model  capable  of  simulating  the  normal  personnel  life  cycle  (procurement, 
reenlistment,  promotion,  separation)  of  the  enlisted  population  under 
varying  sets  of  policies  and  situations.  Force  can  be  disaggregated  by 
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career  management  field,  sex,  race,  educational  level  and  mental  group. 
Proponent  office  is  Enlisted  Division,  Directorate  of  Military  Personnel 
Management,  DCSPER  --  CPT  Rose,  OX  5-9419.  3  Yo!$  __ 

d.  Central  Integrating  Model  -  Officer  (CIM-O)  an  officer  force  pro¬ 
jection  model  similar  to  CIM-E  except  that  CIM-0  employs  a  modeling 
technique  which  permits  many  more  types  of  disaggregations  than  the  enlisted 
model.  Proponent  office  is  Officer  Division,  Directorate  of  Military 
Personnel  Management,  DCSPER  —  MAJ  Fishback,  OX  7-6873.  7  \jtfC 

3.  Further  information  on  the  models  can  be  gained  by  contacting  the 
proponents  listed  above. 

FOR  THE  DEPUTY  CHIEF  OF  STAFF  FOR  PERSONNEL: 
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Lt  Colonel,  GS 
Contracting  Officer’s 
Technical  Representative 
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FOREWORD 


GE-TEMPO,  after  discussions  with  the  Secretary  of  the  Army 
and  the  Deputy  Chief  of  Staff  for  Personnel,  initiated  a  two-year 
project  known  as  the  Army  Manpower  Prediction  System  (AMPS) 
on  June  30,  1972.  Its  purpose  was  to  assist  the  Army  in  making 
more  useful  and  accurate  projections  of  manpower  strength  and 
composition.  After  six  months  of  study,  TEMPO  submitted  to 
the  Army  a  Phase  I  report  outlining  areas  in  which  the  Army  can 
significantly  improve  its  manpower  projections  and  recommending 
approaches  to  making  such  improvements. 

The  Army  adopted  a  majority  of  these  recommendations,  and 
asked  TEMPO  to  develop  the  following  models  in  Phase  II:  the 
AID  Doss  Probability  System  for  Enlisted  (AID-E),  the  AID 
I, oss  Probability  System  for  Officers  (AID-E),  the  Central 
Integrating  Model  for  Enlisted  (CIM-E),  and  the  Central  Inte¬ 
grating  Model  for  Officers  (CIM-O). 

The  work  on  the  AID-E  was  carried  out  between  March  and 
November  of  1973. 

This  report  in  three  volumes  (1)  describes  TEMPO'S  analytical 
approach  to  development  of  the  AID-E.  (2)  describes  the  AID-E 
itself,  and  (3)  outlines  procedures  for  operating  the  system. 
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ABSTRACT 

The  Department  of  the  Army  has  many  manpower  projection  models, 
each  of  which  serves  different  purposes  for  different  managers.  Since 
the  objectives  of  these  managers  are  interrelated,  it  would  be  useful 
to  the  Army  if  t lie  loss  rates  used  in  these  models  were  not  only 
reliable  but  also  consistent,  so  that  the  projections  of  one  model  could 
easily  be  compared  with  the  projections  of  others,  in  response  to  this 
need,  GE- TEMPO  has  devised  a  system  capable  of  generating  several 
sets  of  loss  rates  from  one  basic  set  of  loss  rates.  The  generated 
rates  can  be  used  in  both  presently  existing  and  future  generations  of 
manpower  projection  models  with  consistency  and  reliability.  This 
system  is  called  the  AID  Loss  Probability  System  for  Enlisted.  The 
core  of  the  system  is  a  statistical  technique  which  groups  the  actual 
enlisted  population  into  several  subpopulations  according  to  personal 
characteristics  most  predictive  of  losses  of  individuals  in  each  sub¬ 
population.  A  loss  probability  is  developed  for  each  individual  who 
belongs  to  a  specific  subpopulation  and  any  desired  loss  format  (e.  g.  , 
grade  by  years  of  service)  can  be  generated  by  summing  over  the 
appropriate  individual  characteristics.  Since  each  individual's  personal 
characteristics  and  his  loss  probability  have  a  one-to-one  correspondence, 
no  matter  how  individuals  are  grouped  for  the  purpose  of  generating  loss 
rates  (e.  g.  ,  by  grade,  MOS,  or  any  other  relevant  characteristic) 
each  individual's  characteristics  and  his  loss  probability  are  preserved. 
The  loss  probability  of  the  group,  however  defined,  is  always  the  sum 
of  the  loss  probabilities  of  actual  individuals. ^ It  is  apparent  that  this 
consistency  renders  flexibility  for  general  ing  ;r  variety  of  loss  formats 
for  all  conceivable  models.  Eurthermore,  any  historical  probabilities 
can  be  easily  adjusted  to  reflect  current  or  proposed  changes  in  man¬ 
power  policy.  This  report  describes  the  All)  Loss  Probability  System, 
shows  sample  predictions  developed  from  historical  data,  compares 
these  predictions  to  actual  loss  experience,  describes  how  the  results 
of  the  system  can  be  applied  in  other  models,  and  explains  how  to  use 
the  system.  The  results  of  this  study  show  that  the  system  is  ar 
excellent  method  for  predicting  loss  rates:  its  margin  of  error  for 
each  enlistment  category  is  less  than  1.  5"»  in  K \  T.i  and  less  than  .'L  fi"« 
in  EY70.  Finally,  the  system  is  so  versitile  that  it  can  be  used  for 
purposes  other  than  predicting  losses.  For  example,  n  can  generate 
various  reports  such  as  distribution  of  age-group.  ,  m.irital  status, 
MOS-imbalances,  and  other  information  contained  in  the  KMF  and 
Gain/ Loss  Transaction  File.  Also  the  system  can  tabulate  and 
analyze  information  gathered  by  DOT) /Army  surveys. 
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Chapter  1 


INTRODUCTION 

GE-TEMPO,  after  discussions  with  the  Secretary  of  the  Army 
and  the  Deputy  Chief  of  Staff  for  Personnel,  initiated  a  two-year 
project  known  as  the  Army  Manpower  Prediction  System  (AMPS) 
on  June  30,  1072.  Its  purpose  was  to  assist  the  Army  in  making 
more  useful  and  accurate  projections  of  manpower  strength  and 
composition.  After  six  months  of  study,  TEMPO  submitted  to  the 
Army  a  Phase  I  report  outlining  areas  in  which  the  Army  can 
significantly  improve  its  manpower  projections  and  recommending 
approaches  to  making  such  improvements. 

The  Army  adopted  a  majority  of  these  recommendations,  and 
asked  TEMPO  to  develop  the  following  models  in  Phase  II:  the  AID 
I.oss  Pro  ability  System  for  Enlisted  (AID-E),  the  AID  Loss  Proba¬ 
bility  System  for  Officers  (AID-O),  the  Central  Integrating  Model 
for  Enlisted  (CIM-E),  and  the  Central  Integrating  Model  for  Officers 
(CIM-O). 

The  work  on  the  AID-E  was  carried  out  between  March  and 
November  of  1973. 

This  report.  (1)  describes  TEMPO'S  analytical  approach  to 
development  of  the  AID-E,  (2)  describes  the  AID-E  itself,  and  (3) 
outlines  procedures  for  operating  it. 

1.  1  Purpose  of  the  AID-E 

The  purpose  of  the  AID-E  is  to  provide  reliable  and  consistent 
loss  rates  (including  reenlistmcnt  rates)  for  existing  and  future 
generations  of  manpower  projection  models.  Use  of  these  rates  would 
integrate  the  Army's  presently  decentralized  manpower  projections. 

1.2  Scope  of  the  AID-E 

The  AID-E  can  generate  loss  rates  (including  reenlistment  rules) 
of  active  enlisted  personnel,  both  men  and  women,  by  enlistment 
category,  fry  cause  of  loss,  and  by  other  personal  characteristics. 

To  illustrate  the  flexibility  of  the  AID-E,  the  rales  generated  for 
inclusion  in  this  report  are  in  formats  required  by  the  CIM-E,  J’lA, 
and  MAIL  models.  It  should  be  kept  in  mind,  however,  that  AID-E 
is  capable  of  generating  loss  rates  in  virtually  any  format  which  may 
be  needed  by  Army  manpower  planners. 
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The  system  does  not  include  warrant  officers,  officers,  reservists, 
or  civilians  serving  the  Army.  Loss  rates  for  officers  will  he 
generated  by  the  AID-O  model  which  will  be  completed  by  August  1974. 

1.  3  Organization  of  the  Report 

This  final  report  on  the  AID  Loss  Probability  System  for 
Enlisted  (AID-E)  consists  of  three  volumes  with  the  following  titles: 

Volume  I  Executive  Summary 

Volume  II  Manual  for  Planner/User 

Volume  III  Manual  for  Programmer/Systems  Analyst 

Volume  I,  the  Executive  Summary,  contains  an  overview  and 
brief  description  of  what  the  model  is  and  what  it  does.  The  overview 
provides  an  understanding  of  the  capabilities,  limitations  and  assump¬ 
tions  of  the  AID- El;  description  of  the  system  includes  resource 
requirements  and  examples  of  the  system's  output. 

Volume  II,  the  Manual  for  Planner/ User,  is  intended  for  action 
officers  whose  responsibility  is  to  analyze  manpower  plans  and  programs 
and  to  make  projections.  It  contains  a  detailed  description  of  the 
system  and  the  system  logic.  Detailed  descriptions  of  input  require¬ 
ments  are  provided  along  with  discussions  of  operating  considerations 
and  of  proper  interpretation  of  results.  This  volume  provides  the 
user  with  the  information  necessary  to  set-up  input  for  computer  runs 
and  discusses  various  analytical  approaches. 

Volume  III,  the  Manual  for  Programmer/System  Analyst,  contains 
the  minute  details  of  the  AID-E  system.  The  details  of  each  computer 
program  in  the  system  are  provided  to  facilitate  modification  of  the 
system  if,  at  some  future  date,  modification  becomes  necessary. 

A  table  of  contents  covering  all  three  volumes  has  been  included 
in  each  volume  so  that  readerr  may  locate  sections  most  appropriate 
to  their  interests. 

1.4  Participation  of  the  Army 

The  Army  assigned  Col.  Warren  A.  Spaulding,  Lt.  Col.  Norman 
C5.  Hlahuta,  and  Maj.  .lohn  E.  Pearson,  .1  r.  to  Phase  1  of  the  AMI'S 
project  on  a  full-time  basis.  Many  other  officers  from  the  ODCSJ’ER, 
notably  Captain  Roger  W.  Collins,  gave  very  generously  of  their  time. 


2 


t 


1 


During  the  development  of  the  AID-E  in  Phase  II,  Lt.  Col,  Joseph 
V.  Rafferty,  Lt.  Col.  Robert  Lewis,  Maj.  John  E.  Pearson,  Capt. 

Roger  W.  Collins,  and  Lt.  Richard  Gudewicz  were  most  helpful.  Many 
officers  and  civilians  in  the  ODCSPER,  MILPERCEN,  and  USAMSSA 
also  gave  their  assistance. 

TEMPO  is  grateful  for  office  space  made  available  in  the  Pentagon 
to  TEMPO  staff  throughout  the  project. 

1.  5  TEMPO  Staff  and  Consultants 

1.  5.  1  Phase  I 

The  Phase  I  effort  of  the  AMPS  was  assisted  by  Dr.  Kwan  H.  Kim, 

G.  E.  Consultant.  The  TEMPO  team,  under  the  direction  of  Dr.  Steve 
Enke  and  Mr.  Terry  L.  Schilling,  has  included  Dr.  William  E.  McFarland, 
Jr.,  Mr.  Richard  A.  Drown,  Mr.  Stephen  Chabon,  Mr.  Bryan  D.  Hickman, 
Mr.  James  L.  Johnston,  and  Miss  Cynthia  Bennett.  TEMPO  wishes  to 
acknowledge  assistance  received  from  the  Honorable  Iladlai  A.  Hull, 
Assistant  Secretary  of  the  Army  (Manpower  and  Reserve  Affairs); 

Lieutenant  General  Walter  T.  Kerwin,  Jr.  ,  former  Deputy  Chief  of 
Staff  for  Personnel;  General  Paul  D.  Phillips  (Ret.  ),  Deputy  Assistant 
Secretary  of  the  Army  (Manpower  and  Reserve  Affairs);  and  many 
others. 

1.5.2  Phase  II:  The  AID-E 

The  AID-E  was  developed  by  Ken  R.  Powell  and  Terry  L.  Schilling 
of  GE-TEMPO,  with  the  advice  and  assistance  of  Dr.  Kwan  II.  Kim, 
consultant.  Nancy  Karp  of  GE-TEMPO  was  helpful  in  programming  the 
first  version  of  the  data  edit  program. 
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1.  6  AID-E  as  a  Means  to  Integrate  Projection  Models 
1.  6.  1  Through  Consistency 

The  process  of  manpower  planning  and  programming  in  the 
Army  is  organized  into  several  directorates  and  divisions  along  many 
functional  lines  (usually  interrelated  in  ultimate  purpose).  As  a 
result,  a  high  degree  of  decentralization  has  resulted  and  a  variety 
of  projection  models  have  been  developed.  These  models  project 
enlisted  strengths  in  a  number  of  forms  and  the  projected  strengths 
usually  differ  from  one  another  and  lack  compatibility.  The  dispar¬ 
ities  could  be  reduced  if  all  models  were  to  use  the  same  consistent 
input  parameters;  a  key  parameter,  of  course,  is  loss  rate.  Since 
the  AID-E  is  capable  of  generating  loss  rates  in  a  form  suitable  for 
use  in  any  model  from  a  basic  set  of  loss  rates  (i.  e.  "AID  tree"), 
the  use  of  the  rates  from  AID-E  would  make  the  loss  projections  of 
all  models  consistent. 

1.  6.  2  Through  Accuracy 

Since  the  need  for  new  accessions  is  largely  determined  by 
losses  incurred  to  existing  strength,  the  use  of  accurate  loss  rates  is 
fundamental  to  a  credible  manpower  projection.  Estimates  of  loss 
rates  of  Army  personnel  are  developed  in  many  forms  from  several 
sources;  often  they  are  expressed  in  very  gross  terms  without  regard 
to  individual  characteristics  which  are  prone  to  both  loss  behavior  and 
cause  of  loss.  These  rates  frequently  compete  with  one  another  in  the 
policy-making  process;  it  is  impossible  to  validate  which  model  can 
be  trusted  most.  The  preliminary  results  of  this  study  show  that 
the  AID-E  reliably  produces  loss  rates  of  very  great  accuracy, 
varying  from  actual  loss  experience  by  only  a  small  percentage.  The 
Army's  use  of  sucli  accurate,  validated  rates  would  reduce  the  pre¬ 
sent  state  of  confusion  and  integrate  various  projection  models  with¬ 
out  loss  of  the  advantages  of  the  present  decentralized  manpower 
management. 

1.  7  Assumption 

The  AID  analysis  is  based  upon  a  set  of  predictor  variables  re¬ 
presenting  personal  and  circumstantial  characteristics  contained  in 
the  EME  at  a  specific  point  in  time.  It  does  not  consider  any  other 
variables.  Thus  states  of  war  or  peace,  the  unemployment  rate  in 
civilian  industries,  and  enlisted  men's  attitudes  toward  the  Army  are 
not  accounted  for  in  the  AID  analysis.  If  the  values  of  these  and  other 
potentially  important  variables  have  been  stable  over  several  years  and 
are  expected  to  be  so  in  the  immediate  future,  there  is  no  need  to 
weigh  their  impact  on  the  AID  analysis.  However,  if  the  values  of  such 
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ommitted  variables  change  frequently  and  become  influencing  factors 
on  loss  behavior  in  the  future  (or  are  expected  to  become  so),  predic¬ 
tions  of  losses  based  upon  static  EMF  data  alone  may  miss  a  shift 
in  loss  behavior  over  time. 

One  method  of  incorporating  the  variables  representing  war/peace, 
unemployment  rate,  etc.  ,  in  the  AID  analysis  is  to  add  the  values  of 
these  variables  to  the  transaction  data.  More  specifically,  when¬ 
ever  a  transaction  occurs  to  a  person  at  a  certain  date,  the  values 
of  these  variables  as  of  that  data  can  be  added  to  his  transaction 
record.  These  extra  variables  can  be  easily  added  to  the  transaction 
data  during  data  editing  and  later  used  as  predictor  variables  in  the 
AID  analysis. 

In  summary,  t He  AID  analysis  is  accurate  in  generating  loss 
rates  for  use  in  predicting  losses  if  (1)  all  changes  in  loss  behavior 
are  a  result  of  changes  in  personal  characteristics  of  the  enlisted 
force  as  represented  by  the  variables  contained  in  the  EMF  and  (2) 
there  are  no  changes  in  potentially  predictive  variables  not  contained 
in  the  EMF. 

1.8  Capabilities  and  Limitations 

The  AfD-E  is  designed  to  handle  a  very  large  number  of  vari¬ 
ables  and  of  EMF  records  since  the  AID  statistical  routine  is  capable 
of  analyzing  simultaneously  this  large  quantity  of  information  in  search¬ 
ing  for  the  variables  most  predictive  of  loss  behavior.  The  only- 
limit  on  the  number  of  variables  which  can  be  used  is  the  content  of 
the  EMF  itself. 

The  AID-E  is  designed  to  generate  loss  rates  in  any  conceivable 
format  with  accuracy  and  consistency. 

The  AID  analysis  is  best  used  to  produce  annual  loss  rates. 

(  sing  the  AID  routine  to  produce  loss  rates  covering  a  shorter  period 
is  not  recommended  for  both  practical  and  statistical  considerations. 

This  limitation  is  considerable  in  view  of  the  fact  that  most  protection 
models  use  monthly  or  quarterly  loss  rates.  A  great  deal  of  care 
must  be  exercised  in  converting  annual  rates  into  monthly  or  quarter¬ 
ly  rates.  In  order  to  preserve  consistency  in  t  he  use  of  AID  loss  rates 

Estimation  of  the  values  of  such  variables  is  a  major  research 
task  in  itself. 


in  monthly  and  quarterly  projection  models,  the  Army  cannot  be  too 
strongly  urged  to  designate  an  official  agency  in  charge  of  converting 
AID's  annual  loss  rates  into  monthly  and  quarterly  rates. 

The  AID-E  loss  rates  do  not  agree  with  loss  rates  derived  from 
the  DCSPER-46  Report.  However,  the  AID-E  rates  are  correct  for 
the  CIM,  PIA,  MAIL  and  other  models  as  long  as  these  models  add 
new  accessions  after  losses  are  taken  out  from  the  beginning  inventory. 
In  fact,  the  loss  rates  developed  by  the  AID-E  take  into  account  the 
Ist-month  men,  2nd-men,  3rd-month  men  and  so  on.  Therefore, 
the  AID-E  rates  serve  these  models. 

In  order  to  make  AID-E  estimated  losses  agree  with  rates 
derived  from  the  DCSPER-46  Report,  the  following  modification 
on  the  AID-E  must  be  added:  New  accessions  during  the  fiscal  year 
should  be  added  to  the  EMF  snapshot  taken  at  the  beginning  of  the  fiscal 
year  for  the  AID-E  analysis.  This  augmented  EMF  inventory  would 
then  be  analyzed  by  AID-E  producing  loss  rates.  However,  these 
modified  loss  rates  will  not  be  correct  for  the  CIM,  PIA,  and  MAIL 
models  that  AID-E  was  originally  designed. 

1.  D  Resources  Required  to  Operate  the  AID-E 

The  AID-E  requires  extensive  investment  in  (a)  analyst's  and 
system  operator's  time  and  (b)  computer  facilities  and  supplies, 
lie  low  is  a  list  of  the  estimated  cost  of  various  resources  necessary 
to  run  the  AID-E  to  predict  loss  rates  and  number  of  losses  for  each 
of  the  enlistment  categories  and  loss  causes  listed  in  TABLE  2  for 
use  in  the  CIM-E  model: 

Analyst  2  man-days 

System  Operator  5-10  man-days 

ppp  375-575  minutes 

Magnetic  Tapes  7  reels 

Other  (Clerical  and  consulting,  etc.)  5  man-days 

Assuming  this  run  is  repeated  four  times  a  year  and  loss  rates  re¬ 
quired  by  other  models  are  also  generated  regularly,  the  system  is 
obviously  expensive.  However,  the  budgetary  savings  which  will 
result  from  (a)  more  accurate  manpower  projections  which  reduce 
costly  errors  in  manpower  management  and  (b)  use  of  one  accurate 
set  of  loss  rates  for  all  models  will  far  outweigh  these  expenses. 
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I.  10 


Problems  Encountered 


During  the  development  of  AID-1’,  in  nmtnal  technical 
problems  were  encountered  su<  h  as  compim-.  imiiiations  (e.  g.  , 
core  capacity,  availability  of  CPE  lime.  etc.  ),  the  selection  of 
predictors  and  the  "best''  statistical  approach  hut  these  types  of 
problems  were  solved.  One  problem  the  use-  of  AID-E  will  encounter 
in  supplying  loss  rates  to  other  models  is  a  <  lear  concise  definition 
of  populations  the  loss  rates  nr*,  applied  in  The  logic  of  each  man¬ 
power  model  is  different  and  <  onseqnent  1  s  a  cpuires  different  loss 
rates.  For  example,  loss  rates  lor  a  particular  model  maybe 
applied  to  the  enlisted  population  ..:t>-i  i  ci  r-.. -units  have  been  sub¬ 
tracted  out  of  the  population.  In  anothei  model  the  same  types  of 
loss  rates  may  be  applied  to  the  entin  population.  It  is  essential 
that  the  populations  the  rates  nilt  ippl  .  >■  :.n -fully  defined, 

1.  U  Future  Areas  of  Research 


The  AID-E  system  is  nm  om 
rates,  but  is  also  an  excellent  system  1 
tion  for  various  Army  manpower  stuuu  . 
data  elements  contained  on  magnetic  lap. 
determine,  for  example,  the  inflm-m  iu; 
mismatch.  Studies  can  be  performed 
of  individuals  in  each  MOS  and  identifv  v 
desirable  to  retain  an  individual  in  an  p. 
origin,  marital  status,  pcrformaiu  »  n  i 
other  factors  complicate  manpmv  i 
factors  choosing  the  important  .  , 
and  uses  these  characteristics  ,u-  m.- 
be  made.  Skillful  use  of  AID-E  can  «■<  : 

areas  other  than  loss  predict  im.s. 


i  i  1  ■ 1  , .  pi  educing  loss 

I  >  immediate  informa- 
V.  ■'  i  'lie  large  number  of 
:  tom  can  be  used  to 
m!i  cause  MOS 
no  tui  racteristics 
,  .  i  i«  rist ics  are 

i  ,  vl- i  leographical 
\.  and  many 
i  i  \!  I  >  -E  sorts  these 
uii  t>pe  of  study 
i •! .  ■  ach  j< i  -dictions  can 

i  ■  i  nei'it  the  Army  in 
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Chapter  2 


BASIC  CONCEPT  OF  THE  AID-  E 
2.  1  Statistical  Aspect 

Central  to  the  AID  is  a  statistical  technique  known  as  the 
Automatic  Interaction  Detector  (AID)-,  developed  by  the  Survey 
Research  Center  of  the  Institute  for  Social  Research  at  the  University 
of  Michigan.  Given  a  list  of  explanatory  (or  predictor)  variables, 
the  AID  ranks  these  variables,  one  at  a  time,  in  order  of  their 
importance  in  explaining  loss  behavior.  This  is  accomplished  in  a 
series  of  steps  using  binary  splits.  At  each  step  the  population  is 
divided  into  all  conceivable  pairs  of  two  mutually  exclusive  sub¬ 
groups  for  each  predictor  variable.  After  examining  all  such  pairs 
for  all  variables,  the  pair  which  accounts  for  the  greatest  amount 
of  variance  from  the  average  loss  rate  for  that  particular  population 
group  is  chosen  for  the  binary  split.  The  AID  then  searches  for  the 
next  most  important  split  in  the  same  manner  as  before.  The  AID 
stops  splitting  when  there  are  no  further  subgroupings  which  can 
explain  a  statistically  significant  amount  of  variance  within  the 
limit  set  by  the  analyst.  At  each  step,  the  AID  provides  the  loss 
probability  for  each  subgroup  as  well  as  the  statistical  significance 
of  the  split. 

2.  2  Logical  Aspect 

The  aid  is  a  logical  way  of  analyzing  losses  for  several 
reasons. 


First,  the  AID  groups  a  population  into  several  subgroups 
according  to  homogeneity  of  personal  and  circumstantial  character¬ 
istics  which  influence  loss  and  reenlistment  behavior. 

Second,  the  AID  is  receptive  to  changes  in  Army  strength 
and  composition  and  characteristics  of  the  force  (e.  g. ,  educational 
level)  over  time.  Such  changes  cause  both  growth  and  decay  of 
certain  population  subgroups,  affecting  overall  losses.  The  impact 
of  these  changes  is  automatically  measured  by  the  AID. 

Third,  the  AID  per  mits  logical  adjustment  of  rates  where  it 
is  known  that  future  experience  will  be  different  from  past  experience. 
For  instance,  if  a  three  month  early  release  program  is  planned  for 
first  term  enlisted  men  in  the  coming  year,  it  is  possible  to  isolate 
the  particular  AID  subgroups  of  men  who  are  first-termers  and  whose 

-All  other  modules  (or  subsystems)  in  the  AID  are  designed 
either  to  prepare  the  data  needed  for  the  AID  or  to  summarize  (and 
generate)  the  AID  results  into  suitable  matrices  for  use  in  Army 
projection  models. 


ETS  dates  lie  within  the  thirteenth  to  the  fifteenth  months,  and  to 
adjust  their  loss  rates  into  line  with  similar  groups  whose  ETS  date 
is  within  12  months.  No  changes  have  to  be  made  to  other  groups. 

Fourth,  the  AID-E  provides  a  useful  basis  for  understanding 
why  losses  are  occurring  and  for  developing  remedial  policies  to  influence 
particular  types  of  losses.  Knowledge  of  characteristics 
of  each  group  is  helpful  in  identifying  specific  target  populations  for 
special  remedial  policies.  Special  AID  analysis  can  be  made  to 
study  specific  types  of  loss  that  are  usually  not  separated  for  prediction 
purposes. 

Fifth,  the  ATD-F  can  handle  large  quantities  of  data,  large 
numbers  of  variables,  and  variables  which  are  expressed  in  non- 
quantifiable  categories.  These  features  permit  the  analyst  to  search 
for  all  available  information  with  regard  to  loss  and  reenlistment 
behavior. 

2.  3  Overview  of  the  ATD-F 

Shown  in  FIGURE  1  is  a  graphical  representation  of  the 
AID-E.  The  system,  verbally  described,  starts  with  two  basic 
sources  of  data.  The  first  is  an  EMF  extract  which  contains  approx¬ 
imately  40  elements  of  information  for  each  enlisted  man  at  a 
specific  point  in  time.  The  second  source  of  data  is  an  extract  of 
Gain /Loss  Transaction  Files.  This  extract  contains  approximately 
10  elements  of  information  on  gain/loss  experience  of  each  enlisted 
man  during  a  specified  time  period.  *  These  two  data  extracts  are 
edited  in  a  form  acceptable  for  AID  statistical  analysis  by  two 
computer  programs--the  EMF  Edit  Program  and  Gain/Loss  Edit 
Program. 

The  two  edited  extracts  are  then  merged  by  the  Merge  Program, 
producing  a  single  data  file  referred  to  as  the  Enlisted  Historical 
File.  From  this  file  containing  records  for  each  enlisted  man,  the 
analyst  can  choose  an  enlistment  category. *  Throughout  the  stages 
of  data  preparation,  reports  are  produced  with  error  counts  (along 
with  diagnosis  of  errors)  which  evaluate  and  summarize  data  quality. 

:;:The  time  period  chosen  for  the  AID  analysis  is  usually  a 
year,  but  it  can  be  varied.  The  important  thing  is  that  the  beginning 
date  of  the  Gain/Loss  File  must  match  the  date  of  the  EMF  extract. 

**The  enlistment  categories  are  First-term  AUS,  First-term  RA, 
and  Careerists.  Alternatively,  careerists  can  be  divided  into  careerists 
eligible  for  retirement  and  careerists  ineligible  for  retirement. 


Next  the  Enlisted  Historical  File  is  analyzed  by  the  AID 
statistical  routine  which  generates  loss  rates  by  enlistment  category 
and  loss  cause.  The  analyst  can  further  recode  the  data  at  the  time 
the  AID  statistical  routine  is  executed  and  he  can  also  exclude  certain 
individual  records  and  variables.  The  primary  output  of  the  AID 
routine  is  a  collection  of  mutually  exclusive  statements  of  personal 
characteristics  along  with  probabilities  of  loss  for  each  statement. 

Kach  statement  is  a  step-by-step  description  of  a  series  of  binary 
splits  made  upon  successive  predictor  variables  arranged  in  order 
of  their  importance  in  explaining  losses.  Since  the  statements  are 
mutually  exclusive,  every  individual  in  the  population  must  belong  to 
one  (and  only  one)  of  these  statements  and  its  associated  loss 
probabiliry.  The  analyst  can  draw  the  collection  of  statements  and 
associated  probabilities  as  an  upside-down  tree,  tracing  each 
statement  from  the  point  at  which  the  trunk  first  splits  (the  most 
important  predictor  variable  for  the  collection)  to  the  tip  of  the 
lowest  branchlet  (the  last  and  least  important  predictor  for  a  particular 
statement).  After  a  critical  review  of  the  AID  tree  is  made  for  logical 
consistency  and  accuracy  of  loss  probabilities,  the  AID  tree  is 
transferred  as  input  to  the  Rate  Generator  Program. 

The  Rate  Generator  Program  then  interrogates  an  extract--  of 
the  current  EMF  (edited  by  the  previously  mentioned  EMF  Edit 
Program),  record  by  record.  The  purpose  of  the  interrogation  is 
(a)  to- compare  the  personal  characteristics  of  each  individual  in  that 
extract  to  the  statements  of  characteristics  which  make  up  the  AID 
tree  and  (b)  to  assign  to  each  individual  the  loss  probability  associated 
with  the  statement  that  describes  him. 

In  order  to  produce  loss  rates  in  a  certain  format,  say  by  grade 
and  years  of  service  (YOS),  the  individual  records  in  the  current  extract 
are  then  sorted  by  grade  and  YOS;  the  loss  probabilities  of  all 
individuals  in  each  grade-YOS  cell  are  summed  over;  and  the  sum  is 
divided  by  the  number  of  the  individuals  in  that  cell  to  obtain  a 
weighted  average  of  loss  Drobabilities.  This  weighted  average  is 
the  loss  probability  for  that  cell.  The  Rate  Generator  Program  can 
produce  a  matrix  of  loss  probabilities  in  any  format  the  analyst  desires 
within  the  limit.  --• 


-  Again,  this  extract  can  be  either  the  entire  enlisted  force  or 
a  random  sample. 

- -  First,  the  variables  chosen  for  the  matrix  must  be  contained 
in  the  EMF  extract.  Second,  the  matrix  accepts  at  maximum  three 
variables  at  a  time. 
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FIGURE  1  -  AN  OVERVIEW  OF  THE  AID-E 


EMF 


i:.mf 


Ldi  l 


Extract  Needed 
Data  Elements 
(by  MILPERCEN) 

Extract 


lid  i  t  Data 

(be  EMF  Edit  Program) 
ed  EMI-'  .  Edited  EMF  Report 


Gain/Loss 
Transaction  File 

Extract  Needed 
Data  Elements 
(by  MILPERCEN) 

-  V 

Gain/Loss  Transaction 
Extract 

Edit  Data 

i  (by  Gain/Loss  Edit 

!  Program) 
v 

Edited  Gain/Loss  Edited 

1  Transaction  File  Gain/Loss 

File  Report 


Merge  Files 
(by  Merge  Program) 


Enlisted  Historical  File  ___  >  Edited  Enlisted 

1  Historical  File 

Report 

Generate  AID 
Trees 
(by  AID 
Statistical 
Routine ) 

AID  Probability  Trees 

(for  enlistment  and  loss  categories) 


Most  Recent  EMF 


Generate 
Loss  Rates 
in  Desired 
Format  (by 
1  Rate 

Generator 

Program) 


Extract  Needed  Data 
Elements  (bv 
MILPERCEN) 


New  EMF  Extract 

Edit  Data 

(by  EMF  Edit  Program) 


New  Edited  EMF  t  - 


r  New  Edited  EMF  Report 


v 

Loss  Rates 

Cards/Tape/Hard  Copy 


f 


11 


2.  4  Method  for  Generating  Loss  Rates 


Two  methods  for  generating  loss  rates  may  be  used:  direct 
and  indirect. 

The  direct  method  is  the  method  by  which  the  AID  routine 
generates  the  ratio  of  the  number  of  losses  from  a  specific  cause  to 
the  number  of  persons  in  the  population.  The  resulting  AID  tree  is 
input  to  the  Rate  Generator  Program.  Loss  rates  in  a  desired  format 
are  produced  by  this  program  in  the  manner  described  at  the  end  of 
the  previous  section. 

The  indirect  method  uses  two  AID  trees.  The  first  tree  is 
produced  by  calculating  the  ratio  of  the  number  of  all  losses  (i.  e. , 
irrespective  of  cause  of  loss)  to  the  number  of  persons  in  the 
population.  The  second  tree  is  produced  by  calculating  the  ratio  of 
the  number  of  losses  from  a  specific  cause  to  all  losses.  The  two 
trees  are  transferred  to  the  Rate  Generator. Program  in  which  the 
following  steps  are  taken  to  generate  loss  rates  in  a  desired  format. 
The  characteristics  of  individuals  in  the  current  KMF  are  compared 
with  the  statements  of  personal  characteristics  which  make  up  the 
two  AID  trees,  and  each  individual  is  assigned  to  the  loss  rate 
associated  with  the  two  statements  (one  from  each  tree)  which  describe 
him.  The  two  rates  are  then  multiplied  to  produce  a  single  rate  for 
ear h  individual.  When  this  multiplication  is  done  for  all  individuals, 
the  loss  rate  for  a  cell  specified  by  a  loss  format  is  generated  by 
summing  over  the  loss  probabilities  of  all  persons  in  that  cell  and  the 
sum  is  divided  by  the  number  of  persons  in  that  cell. 

The  choice  of  one  method  over  the  other  depends  upon  statistical 
considerations,  the  number  of  transaction  categories  to  be  analyzed, 
and  the  amount  of  computer  time  required.  For  example,  the  indirect 
method  should  be  chosen  when  several  loss  causes  are  being  analyzed 
and  (tie  number  of  losses  from  eacli  cause  is  extremely  small  in 
comparison  to  the  population  being  studied  (since  the  indirect  method 
produces  a  higher  ratio  of  losses  to  population  and  therefore  makes 
analysis  more  effective);  when  there*  is  only  one  loss  cause  to  be 
analyzed,  the  direct  method  is  the  only  choice;  and  when  there  are  are 
ninny  loss  causes  to  be  analyzed  and  the  size  of  the  population  is  very 
largo,  the  indirect  method  saves  computer  run  time. 

In  general,  the  reliability  of  the  indirect  method  is  greater  (1) 
because  two  trees  tend  to  produce  more  splits  and  (2)  because  the 
ratios  arc  higher. 


2.  5  The  Variables 


The  AID-E  is  capable  of  analyzing  a  very  large  number  of 
variables  at  the  same  time.  Most  of  them  are  used  as  predictor 
valuables  in  All)  analysis,  but  some  are  used  for  other  purposes. 

For  example,  the  Social  Security  number  is  used  for  sampling  the 
population  for  the  AID  analysis;  the  error  code  variables,  internally 
generated  by  the  EMF  Edit  Program,  are  used  in  controlling  inclusion 
or  exclusion  of  certain  variables  and  individual  records  in  the  AID 
analysis  based  upon  the  quality  of  data;  and  the  MOS  and  CMF  variables 
are  used  for  sorting  population  in  the  Kate  Generator  Program. 

Several  variables  resemble  each  other  and  seem  redundant, 
but  they  serve  useful  purposes  such  as  reconstructing  missing  data 
and  cross-checking  the  validity  of  certain  variables. 

Chapter  3  contains  sample  results  of  AID  analysis  of  actual 
Arm)  data.  In  these  results,  some  variables  appear  more  often 
‘ban  others  as  loss  predictors.  The  dominance  of  these  variables 
should  not  establish  the  permanency  of  their  importance.  Certain 
o'her  variables  proved  to  be  more  important  during  a  particular 
time  period,  becoming  less  important  in  other  periods.  It  should 
also  be  noticed  that  some  potentially  useful  variables  were  deliberately 
oinif'ed  from  flie  anal  ysis  reported  in  Chapter  3  due  to  poor  quality 
of  the  data.  It  is  quite  possible  that,  as  the  quality  of  data  improves 
o\er  time,  some  of  the  presently  loss  predictive  variables  may  become 
more  useful.  TABLE  1  lists  the  variables  included  in  the  AID-E. 

2.  fi  Causes  of  Loss  by  Enlistment  Category 

Numerous  computer  runs  of  the  All)  were  made  using  actual 
da'a  to  determine  ways  to  improve  reliability  of  the  loss  projection 
s>  stem,  particularly  by  subgrouping  losses  by  enlistment  category 
and  cause  of  loss. 

APer  careful  analysis  of  these  inns,  the  Army  and  TEMPO 
agreed  to  analyze  losses  by  the  transaction  categoi'ies  (groups  of 
loss  causes)  and  enlistment  categories  listed  in  TABLE  2.  The 
results  of  this  analysis  were  used  to  generate  a  variety  of  loss  rates 
in  formats  inquired  by  the  C1JV1-E,  PIA,  and  MAIL  models.  However, 
it  should  be  borne  in  mind  that  the  AID-E,  as  noted  earlier  in  this 
report,  is  capable  of  generating  loss  rates  in  virtually  any  format 
needed  by  Army  manpower  planners. 
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TABLE  1 


V  ariable 

LIST  OF  VARIABLES  IN  THE 

aid-.: 

Possible 

Number 

Data  Element 

Abbreviation 

Value  Range 

1 

Social  Security  Number 

SSN 

0-9 

2 

Military  Occupational  Specialty 

MOS 

XXXXX 

3 

3  Digit  MOS 

Sex 

3MOS 

0  -  999 

4 

SEX 

0-9 

5 

Age 

AGE 

0  -  999 

6 

Race 

RACE 

0-9 

7 

Marital  Status 

MARITAL 

0-9 

8 

Number  of  Dependents 

DEPENDENTS 

0-9 

n 

G  rade 

GRADE 

0-9 

10 

AFQT  Here  entile  Score 

AFQT 

0  -  99 

11 

Mental  Category 

MENTAL-CAT 

0-9 

12 

Educational  Level 

EDUCATION 

0-9 

13 

Proficiency  Pay  Eligibility 

PRO-PAY 

0-9 

14 

Eligibility  for  VRB 

VRB 

0-9 

15 

Major  Command 

MAJCOM 

0-99 

16 

Functional  Status 

STATUS 

0-9 

17 

Average  Efficiency  index 

A  El 

0-9 

18 

Time  in  Grade 

TIG 

0  -  999 

1!) 

Total  Military  Service 

TMS 

0  -  999 

20 

Total  Active  Military  Service 

TAS 

0  -  999 

21 

Expiration  of  Term  of  Service 

ETS 

0  -  99 

22 

28 

Terr  of  Enlistment 

Date  of  Estimated  Return 

TERMS 

0-99 

from  ( iverseas 

DEROS 

0  -  99 

24 

Career  Management  Field 

CM  F 

0-99 

25 

Enlistment  Category 

ECAT 

0-9 

26 

Physical  Profile 

PHYSICAL 

0-9 

27 

Residence  at  Entry 

RESIDENCE 

o 

1 

'O 

28 

MOS  Mismatch 

MOS /.I  OB 

0-9 

20 

Promotion  MOS 

PROMOTION 

0-9 

2.0 

Incidence  of  AWOI. 

AWOL 

0  -  9 

31 

TMS  Error  Code 

El 

0-9 

32 

TAS  Error  Code 

E2 

0-9 

33 

Grade  Error  Code 

E3 

0-9 

34 

MOS  Error  Code 

El 

0-9 

35 

TIG  Error  Code 

E5 

0-9 

36 

ETS  Error  Code- 

E6 

0-9 

37 

Terms  of  Service  Error  Code 

E7 

0-9 

38 

DEROS  Error  Code 

E8 

0-9 

30 

Transaction  Category 

TRANSACTION 

0  -  99 

40 

Process  Date  of  Transaction 

PROCESS 

0  -  9999 

41 

42 

Effective  Date  of  Transaction 
Term  of  Reenlistment  or 

EFFECTIVE 

0  -  9999 

43 

Extension 

Transaction  Date  minus 

REUP/EXT  TERM 

0  -  99!* 

EMF  Date 

14 

TRANS -EMF 

0  -  99 

•  '  «'  f 


TABLE  2 


TRANSACTION  AND  ENLISTMENT  CATEGORIES  BY  SPN 


TRANSACTION 


CATEGORY  &  CODE 

FT-AUS 

FT- 

O.  '  Miscellaneous 

Transactions 

1.  DFMC 

X 

X 

2.  Retirement 

- 

- 

3.  Physical 

X 

X 

l  >i  sqnali  fixation 

4.  Hardship/ 

X 

X 

.  *  Dependency 

1 

5.  Unfitness 

X 

X 

6.  Unsuitability 

X 

X 

7.  Misconduct 

X 

X 

8.  ETS 

X 

X 

9.  Reenlistment 

_ 

X 

Losses 

10.  Early  Release 

X 

X 

ll.  Other  Losses 

X 

X 

CAREERIST  SPN 


SPN  numbers  are  de- 
X  signated  "FOR  OFFICIAL 

USE  ONLY"  and  hence  are 
not  listed  in  this  vol¬ 
ume,  Thev  mav  he  found 
on  pages  U2-83  of  Volume 
tl  Manual  for  Planner/ 

•  User. 


X 


X 


X 


X 


X 


X 


X 
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TABLE  2  (continued) 


TRANSACTION 
CATEGORY  &  CODE 

FT-AUS 

FT-RA 

CAREERIST  SPN 

12. 

Return  to 
Military 

Control 

X 

X 

X 

13. 

Extension 

X 

X 

X 

14. 

Reenlistment 

Gains 

X 

X 

99.  New  Accessions 


X 


X 


2.  7  Use  of  the  Loss  Rates 

In  generating  loss  rates  for  use  in  different  projection  models, 
the  user  of  AID- I'  must  clearly  understand  the  idiosyncrasies  of 
each  model  with  respect  to  (1)  the  time  dimension  of  the  loss  rates  it 
requires  (e.  g. ,  annual  rate,  monthly  rates,  etc.),  (2)  its  logic  and 
simulation  process,  (3)  its  definitions  of  loss  rates  in  the  total 
population  and  in  loss  categories  (i.  e. ,  SPN's),  and  (4)  the  dimensions 
of  the  loss  format  it  requires.  Failure  to  understand  any  of  these 
idiosyncracies  will  result  in  the  generation  of  rates  which  are  completely 
useless. 

The  CljVI-K  model,  for  example,  is  a  monthlv  projection  model 
which  requires,  by  definition,  monthly  loss  rates.  CIM's  logic  for 
loss  calculation  is  in  the  following  sequence:  To  a  monthly  beginning 
inventory  the  attrition  loss  (Transaction  Categories  1  through  7  and 
10  to  11  in  TAISLF  2)  are  applied  and  the  resulting  attritions  are  taken 
out  from  the  inventory,  thus  reducing  the  inventory.  Now,  to  this 
remaining  population,  the  UTS  loss  rates  (Transaction  Category  8  for 
the  first-termers  end  careerists  ineligible  for  retirement)  are  applied 
and  the  further  reduction  of  the  population  caused  by  the  FTS  losses 
is  made.  If  there  are  any  further  losses  (e.  g.  ,  special  r  eleases), 
they  are  also  taken  out  after'  the  UTS  losses.  All  of  the  above 
processes  in  the  CIM  require  defining  and  redefining  of  successive 
populations  through  use  of  SPN’s  for  the  purpose  of  generating 
appropriate  loss  rates.  Another  projection  model  may  not  necessitate 
such  changes  in  the  population  base  if  the  logic  of  that  model  demands 
a  simultaneous  take-out  of  all  losses  from  the  beginning  inventory. 

Finally,  annual  loss  rates  generated  by  the  AID- I  '  must  be 
converted  to  monthly  or  quarterly  rates  with  appropriate  seasonality. 
This  conversion  should  be  made  by  experts  who  not  only  know  past 
experience  but  also  can  accurately  anticipate  month-to-month  future 
policies  of  the  Army  which  may  affect  losses. 
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TABLE  3 

ENLISTED  TRANSACTION  CATEGORIES 


AID  Transaction 

T  ransnction 

SPN 

C^t 

egory 

C  'at  egory 

1. 

DFMC 

DFMC 

SPN  numbers  are  de¬ 
signated  "FOR  OFFICy'L 
USE  ONLY"  and  hence 

2. 

Retirement 

Retired 

are  not  listed  in 

(Non-D) 

this  volume.  They 

3. 

Disability  Ret¬ 
irement 

Disability 

Retirement 

may  be  found  on  pages 
163-164  of  Volume  IT 
Manual  for  i launer/Uner 

4. 

Physical  Dis¬ 

Physical  Dis¬ 

qualification 

qualification 

5. 

Hardship /Dep¬ 
endency 

Hardship 

6. 

Unfitness 

Unfitness 

7.  Unsuitability 


8.  Misconduct 


9.  School  &  Teach¬ 
ing 

10.  Early  Release 
Res/NG 

11.  Other  Early 
Release 


12.  ETS 

13.  Other 


14.  Reenlistment 
(immediate) 


Unsuitability 


Misconduct 

Punitive  Dis¬ 
charge  Good  of 
Service 

School  &  Teach¬ 
ing 

Early  Release 
Res/NG 
Early  Release 

Employment 

OS  Early  Release 

ETS 

Death 

TBOWO 

Failure  of  Promot 
ion 

Both  Losses  and 
Gains 
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TABLE  *  (Continued) 


AH3  Transaction 
Category 


Transaction 

Category 


15. 

16. 

17. 


Research 

Reenlistment  (non- 
immediate) 
Returned  to  Milit¬ 
ary  Control 


Other  Losses 
Gains 

Gains 


1!) 


SPN 

All  other  &  blank 


Chapter  3 


APPLICATION  OP  THE  AID-E 
INTERPRETATION  OF  RESULTS 

3.  1  Introduction 

This  chapter  summarizes  and  interprets  some  of  the  results 
(including  prediced  vs.  actual  losses)  produced  by  the  AID-E.  The 
reader  should  note  that  losses  predicted  by  AID-E  for  FY  1973  vary 
from  actual  losses  during  this  period  by  less  than  1.  5%  and  losses 
predicted  for  FY  1970  vary  from  actual  losses  by  less  than  3.3%. 
When  the  loss  rates  developed  from  FY  70  data  were  applied  to 
FY  73  EMF,  the  margin  of  error  was  less  than  2.  4%. 

Volume  II  of  this  report  discusses  the  formulae  used  for  the 
All)  statistics,  the  meanings  of  these  formulae,  and  the  options 
available  in  running  the  AID  routine.  Chapter  4  of  Volume  II  includes 
appropriate  comments  and  suggestions  on  sample  size,  sampling 
errors,  bias  caused  by  deleting  (or  including)  variables  having 
errors,  and  methods  of  generating  loss  rates. 

3.2  Data  Used  for  Sample  Outputs 

For  purposes  of  testing  the  reliability  and  validity  of  the  AID-E 
both  F'i  s  1970  and  1973  were  used.  For  F Y70  data,  the  date  for  the 
EMF  Extract  is  as  of  duly  1,  196!)  and  the  time  period  covered  for 
the  extract  of  the  Oain/Loss  Transaction  is  from  the  beginning  of 
•  luly  1969  to  the  end  of  June  1970.  For  F Y 7 3  data,  the  date  for  the 
EMF  is  as  of  duly  1,  1972,  and  the  time  period  for  the  extract  of  the 
Cain/ Loss  File  is  from  (he  beginning  of  duly  1872  to  the  end  of  June 
1973.  In  fact,  (IE-TEMPO  used  FY70  data  primarily  as  a  test  and 
experimental  data.  In  this  chapter,  the  results  of  the  AID  runs  using 
both  FY70  and  7.3  data  are  presented. 

3.  3  Results  of  AID  Analysis  Using  FY70  Data 

TAULE  3  lists  the  enlisted  transaction  categories  for  which 
FY70  data  was  used  by  the  AID. 

3.  3.  1  All  Losses  (Based  on  FY70  Data) 

FIGURES  2  through  5  are  the  AID  trees  drawn  to  reflect  loss 
probabilities  for  four  enlistment  categories  developed  by  the  AID 
from  a  sample  of  the  FY1970  enlisted  population.  The  trees  appear 
in  the  following  order:  FT-AUS,  FT-RA,  Careerists  Ineligible  for 
Retirement,  and  Careerists  Eligible  for  Retirement. 
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In  order  to  explore  the  degree  to  which  losses  predicted  on  the 
basis  of  these  AID-developed  probabilities  approach  actual  loss 
experience  during  FY70,  a  second  sample  of  the  enlisted  population 
in  FY70  was  drawn  and  the  persons  in  this  sample  classified  accord¬ 
ing  to  the  sub-populations  (clusters  of  personal  characteristics) 
into  which  AID  grouped  the  first  sample.  The  loss  probability  for 
each  sub-population  generated  by  AID  from  the  first  sample  was 
then  applied  to  the  number  of  persons  in  the  second  sample  described 
by  that  sub-population  to  obtain  "predicted  losses"  for  each  sub¬ 
population.  The  second  sample  was  then  reviewed  again  to  deter¬ 
mine'  the  number  of  persons  in  that  sample  who  were  actually  lost 
in  the  course  of  FY70. 

h  mails  the  MD  trees  developed  from  F N  70  data  were  used 
to  predict  losses  in  F\  73.  The  results  art'  presented  in  Section  .3.  5. 

I  he  results  show  that  A1D-F  is  very  reliable  in  predicting  losses. 
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FIGURE  3  -  AID  TREE  FOR  FT-RA  " 


Loss  Type:  All  losses 
Transactions:  1-13,  15 
EMF  Extract  Date: 


27  mo 


N  =  Population 
L  =  Losses 

P  *  Loss  probabilities 


N=5882 
L=3624 
P= .  61 61 


N=10930 

L=9818 

P=.8983 


FIGURE  4.  AID  TREE  FOR  CAREERISTS 
.  INELIGIBLE  FOR  RETIREMENT  * 

(FY  1970) 


N  -  Population 
L  =  Losses 

P  =  Loss  probabil it ies 

See  Appendix  for  definitions  of  vrriables. 
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FIGURE  5  -  AID  TREE  FOR  CAREERIST 5 

ELIGIBLE  FOR  RETIREMENT* 


N  =  Population 
L  =  Losses 

P  =  Loss  probabilities 


See  'ppondix  for  definitions  of  var  iables 
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TABLES  4  through  7  compare  these  actual  and  predicted 
losses  in  each  sub-population  defined  by  AID,  as  reflected  in  the 
trees  shown  in  FIGURES  2  through  5.  The  transaction  categories 
used  in  the  AID  trees  and  TABLES  4  through  7  are  defined  in 
TABLE  3.  A  few  explanatory  notes  for  these  tables  are  made 
below : 

(a)  The  "BSS/TSS"  is  that  proportion  of  the  total  sum  of 
squares  explained  by  a  given  number  of  AID  splits 
(or  sub-grouping)--a  kind  of  correlation  statistics. 

(b)  The  MEAN''  is  the  overall  loss  rate  for  the  entire 
sample. 

(c)  The  "Cell  Number"'  listed  in  each  box  of  the  second 
column  of  the  tables  corresponds  to  the  AID  tree  cell 
numbers  as  shown  in  FIGURES  2  through  5. 

(d)  The  '"Loss  Probability"  in  the  third  column  shows  the 
probability  attached  to  a  terminal  cell  or  to  the 
characteristics  belonging  to  those  in  the  second 
column. 

(e)  The  "Number  of  Persons"  in  the  fourth  column 
indicates  the  number  of  persons  in  the  second  sample 
who  belong  to  the  characteristics  listed  in  the 
corresponding  box  in  the  second  column. 

(f)  The  "Predicted  Losses"  is  the  number  of  losses 
calculated  by  multiplying  Column  3  by  Column  4. 

(g)  The  "Actual  Losses"  is  the  actual  number  of  losses 
incurred  to  the  population  in  the  second  sample  whose 
characteristics  belong  to  those  in  Column  4. 

(h)  The  "Predicted  Overall  Loss  Rate"  at  the  bottom  is  the 
ratio  of  the  total  in  Column  5  to  that  in  Column  4. 

(i)  The  "Actual  Overall  Loss  Rate  at  bottom  is  the  ratio 
of  the  total  in  Column  6  to  that  in  Column  4. 
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TABLE  4 


COMPARISON  OF  ACTUAL  AND  PREDICTED  LOSSES  (OR  GAINS) 
ENLISTMENT  CATEGORY:  FT-AUS  (FY70) 

t 

TRANSACTION  CATEGORY:  All  losses  (1-13,  15) 

AID  RUN  DATE:  7/26/73 
AID  RUN  NUMBER:  E-7 
SAMPLE  SIZE  FOR  AID  RUN: 

Random  or  Not  Random 

Sample  Number  and  Size:  #1;  47,153  (10  percent) 

SAMPLE  SIZE  FOR  COMPUTING  ACTUAL  LOSSES  (OR  GAINS  ) 

Comparison  Run  Date:  8/30/73 

Random  or  Not  Random  * ** *** 

Sample  Number  and  Size:  #2;  47,220  (10  percent) 

BSS/TSS:  .631 
MEAN:  .5724 


(1) 

(2) 

(3) 

(4) 

(5) 

(6) 

(7) 

Terminal 

Cell 

Numbe  r 

Cell 

Number 

Loss 

Probability 

Number 

of 

Pe  rsons 

Predicted 

Losses 

Actual 

Losses 

Percent 

Error 

1 

4 

.  0892 

15,  682 

1396 

1357 

-3.  1% 

2 

8 

.  2  485 

1.  981 

493 

478 

-2.  9 

*» 

.  j 

9 

.  5336 

2.  672 

1426 

1403 

-1.  6 

4 

6 

.  7073 

3,  804 

2689 

2653 

-1  4 

5 

7 

.  9489 

23,  081 

21904 

21872 

-0.  1 

TOTAL 

47, 220 

27,  908 

27.  783 

-0.  3% 

Predicted  Overall  Loss  Rate  *  .5910 
Actual  Overall  Loss  Rate  =  .5884 

*Actual  -  Predicted 
Actual 

**Major  Command  -  16,  5,  8,  6,  7,  12,  2,  1,  3,  13,  15 

***Major  Command  =  14,  4,  10,  9,  0 


TABLE-  5 


COMPARISON  OF  ACTUAL  AND  PREDICTED  LOSSES  (OR  GAINS) 
ENLISTMENT  CATEGORY:  FT-RA  (FY70) 

t 

TRANSACTION  CATEGORY:  All  losses  (1-13,  15) 

AID  RUN  DATE:  August  6,  1973 
AID  RUN  NUMBER:  E4 
SAMPLE  SIZE  OFRAID  RUN: 

Random  or  Not  Random 

Sample  Number  and  Size:  Sample  #1;  55,705  (10%) 

SAMPLE  SIZE  FOR  COMPUTING  ACTUAL  LOSSES  (OR  GAINS): 

Comparison  Run  Date:  August  29,  1973 
Random  or  Not  Random 


Sample  Number  and  Size: 

Sample 

It 2;  55,427 

(10%) 

BSS/TSS : 

.416 

MEAN:  . 

3516 

'  1 ) 

(2) 

(3) 

‘4> 

(5) 

(6) 

(7) 

To  rn'.  i  n;i 

1  Cell 

1  ,(>SS 

\umbo  r 

Predicted 

!  Actual 

(Percent  ‘ 

CHI 

Xiimbc  v 

Probabi  t  ily 

of 

1  .nsses 

Losses 

•  E r ro  r  ! 

\u  mbo  r 

1  V  rsons 

j  _ : 

1 

2 

1520 

38805 

5898 

5850 

-0.  8% 

2 

4 

6161 

5912 

3642 

3694 

1  4 

3 

5 

8084 

10710 

9622 

9764 

1  5 

TOTAL 

65 

.  427 

19.162 

19, 308 

o .  7% ; 

Predicted 

Overall  Loss  Rate  =  .346 

Actual  Overall  Loss 

Rate  =  .348 

♦Actual  - 

Predicted 

100 

Actual 

TABLE  6 


COMPARISON  OF  ACTUAL  AND  PREDICTED  LOSSES  (OR  GAINS) 

ENLISTMENT  CATEGORY: 

CAREERIST  INELIGIBLE  FOR  RETIREMENT  (FY70) 

t 


TRANSACTION  CATEGORY:  All  losses. 

AID  RUN  DATE:  August  6,  1973 
AID  RUN  NUMBER:  E2 
SAMPLE  SIZE  FOR  AID  RUN: 

Random  or  Not  Random 

Sample  Number  and  Size:  Sample  #1;  23,027  (10%) 
SAMPLE  SIZE  FOR  COMPUTING  ACTUAL  LOSSES  (OR  GAINS): 
Comparison  Run  Date:  August  29,  1973 
Random  or  Not  Random 

Sample  Number  and  Size:  Sample  # 3  -  20% 

BSS/TSS :  .413 

MEAN:  .1604 


_ oi _ m _ m _ (4) 

Terminal  Cell  Loss  Number- 

Cell  Number-  Probability  of 
Number  Persons 


8  . 0476  30805 


.  _(5) _ (6) _ JJ)_ 

Predicted  Actual  Percent 
Losses  Losses  Error 


1466  1569  6.6% 


826 


1101 


4550 


1  5 

7 

.  3542 

1977 

700 

700 

0.  0 

6 

10 

.  5817 

2041 

1187 

1141 

I 

° 

7 

11 

.  8560 

3306 

2830 

2782 

-1.  7 

TOTAL 

44,  606 

7213 

7231 

0.  3% 

Predicted  Overall  Loss  Rate  =  .1617 

Actual  Overall  Loss  Rate  ■  .1621 

*Actual  -  Predicted 

Actus!  2Q 


TABLE  7 

COMPARISON  OF  ACTUAL  AND  PREDICTED  LOSSES  (OR  GAINS) 
ENLISTMENT  CATEGORY: 

CAREERISTS  ELIGIBLE  FOR  RETIREMENT(FY70) 


TRANSACTION  CATEGORY:  All  losses  (1-13,  15) 

AID  RUN  DATE:  August  6,  1973 
AID  RUN  NUMBER:  E9 
SAMPLE  SIZE  FOR  AID  RUN: 

Random  or  Not  Random 

Sample  Number  and  Size:  Sample  #1;  4248  (10%) 
SAMPLE  SIZE  FOR  COMPUTING  ACTUAL  LOSSES  (OR  GAINS): 

Comparison  Run  Date:  August  29,  1973 

Random  or  Not  Random 

Sample  Number  and  Size:  Sample  // 2;  4355  (10%) 
BSS/TSS :  .052 

MEAN:  .3380 


(1) 

(2) 

(3) 

(4) 

(5) 

(6) 

(7) 

To  rmina  1 

Coll 

'Number 

Coll 

Numbor  >  1 

1  ,oss 
’robnbi 

Number' 
lily  of 

IV  rsons 

Predicted 

Losses 

Actual 

Losses 

Per-cent : 

:  Error 

l 

1 

4 

2163 

1433 

310 

316 

i  1.  9% 

2 

,r) 

.  3  340 

105!) 

354 

381 

7.  1 

3 

6 

.  3251 

78! 

254 

262 

3.  I 

4 

7 

.  502  3 

1082 

543 

555 

2.  2 

,  j 

TOTAL 

4355 

1461 

1514 

3.  5";, 

Predicted 

Overall  Loss 

Rate  » 

.3355 

Actual  Overall  Loss  Rate  *  .3476 
*Actual  -  Predicted 


Actual 


100 
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3.  3.  2  Proportion  of  Losses  by  Cause  to  All  Losses  Using  FY  70  Data 


In  Chapter  2,  the  two  methods  for  generating  loss  rates  were 
discussed.  For  some  causes  of  loss,  loss  rates  are  calculated  by 
the  indirect  method.  Given  the  AID  loss  rates  (Li/Pi)*  shown  in 
FIGURES  2  through  5,  the  proportions  of  losses  by  cause  (Lci/Li) 
will  provide  the  loss  rates  for  a  specific  cause  (Lci/Pi)  when  Li/Pi 
is  multiplied  by  Lci/Pi. 

FIGURES  6  through  9  shown  four  examples  of  AID  trees  made 
for  Lci/Li  for  certain  causes:  an  example  is  drawn  from  each  of  the 
four  enlisted  categories.  The  two  sets  of  AID  trees  (one  for  Li/Pi 
and  another  for  Lci/Li)  are  multiplied  in  the  Rate  Generator  Program 
to  produce  the  loss  rates  by  a  specific  cause  (Lci/P).  An  example 
of  this  multiplication  is  given  in  the  ANNOTATED  SAMPLE  PRINTOUT 
OF  RATIO  GENERATOR  PROGRAM  (Appendix  D9)  in  Volume  II. 

The  predicted  losses  in  fact  have  proven  to  be  very  close  to  actual 
losses  using  the  indirect  method. 

3.4  Sample  Results  of  AID  Analysis  Using  FY73  Data 

Figures  10  through  12  are  the  three  AID  trees  drawn  from  the 
three  AID  runs  made  from  FY73  data.  The  transaction  categories 
used  for1  these  All)  i-uns  are  listed  in  TABLE  3.  No  distinction 
is  made  between  careerist-ineligible-for-retirement  and  careerist- 
eligible- for -retirement.  The  two  are  combined  into  a  single 
group:  careerist. 

The  first  three  AID  trees  (FIGURES  lOthrough  13  are  loss  rates 
for  FT-AUS,  FT-RA,  and  Careerist.  The  fourth  tree  (FIGURE  13) 
is  the  AID  run  made  on  the  proportion  of  losses  due  to  retirement  of 
careerists.  When,  for  example,  the  loss  rates  of  careerist  in 
FIGURE  12  are  multiplied  by  the  proportion  of  retirement  losses  in 
FIGURE  13,  the  retirement  loss  rates  are  generated  by  the  indirect 
method  for  various  subpopulations  in  the  careerist  population. 

TABLES  8  through  10  show  the  actual  and  predicted  losses. 


*  Notations:  L=Number  of  Losses;  P  =  Population;  i=Enlistment 
Category;  c=Cause  of  Loss 
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3.  5  Actual  vs.  Predicted  Losses  in  FY  *73  Using  FY  70  Data 

Using  the  AID  trees  in  FIGURES  2-5  foased  on  FY  70  data), 
predicted  losses  are  made  and  compared  with  actual  losses  in  FY  73 
The  results  are  presented  in  TABLES  11-13.  The  prediction  errors 
are  :  1.  1%  for  AUS;  2.  4%  for  first-term  RA;  and  -1.  5%  for  Career¬ 
ists.  Some  of  the  errors  are  attributed  to  the  quality  of  FY  70  data 
and  to  changes  in  definitions  of  several  variables. 

3.6  Usefulness  of  the  A  I  :  -E 

The  Army  and  the  Department  of  Defense  have  a  long  list  of 
manpower  projection  models.  All  of  the  models  need  a  consistent 
set  of  loss  rates.  The  loss  rates  generated  by  the  AID-E,  ,  for 
example,  can  be  used  for  the  CIM,  PIA,  and  MAIL  models.  Also 
the  AID-E  can  be  used  for  generating  various  reports  and  analyses 
for  Army's  special  requirements. 

CiK  has  already  used  the  system  to  make  a  brief  (two-day)  study 
of  the  problem  of  the  Army's  MOS  imbalances.  MOS-mismatch  is 
a  considerable  cause  of  the  Army's  inefficiency  in  using  its  most 
valued  resource:  manpower.  The  objective  of  GE's  brief  study 
was  to  identify  the  characteristics  of  an  enlisted  individual  which 
"best"  explain  why  he  is  an  MOS-mismatch  or  loss  to  the  Army. 

The  problem  deserves  additional,  careful  analysis.  In  fact,  the 
AID-10  is  a  "generalized  information  generating  system". 
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FIGURE  6-  AID  TREE  FOR  FT-AUS  (L3,  13/L) 

TP 


Loss  Type:  Disability 
Retirement  and 
Other  Losses 
Transactions: 

3  and  13 


N=55295 

L=1439 

P-.0260 


Gain/ Loss 
Extract 
Period: 
FY70 
Sample 
Size: 
20% 

BSS/TSS: 

.186 


Func. 
Status* 
5, 1,6, 2 


Func. 
Status= 
3,4 


u 

- Q 

N=5044 

N=4851 

L=481 

L=933 

P= . 0095 

P=. 1923 

- r— c - 

\  — q 

- "HE 

- e 

- D 

N=378 

L=79 

P=.  2090 

N=50063 

L=402 

P= . 0080 

N=396 

L=185 

P=.4672 

N=4455 

L=748 

P=.1679 

- /"  v —  J 

Grade  > 


u 

HE 

N=41 20 

N=335 

«71 

L=142 

P=.4239 

Pro- 
ays  0 


N  =  Total  Losses 
L  =  Losses  for  Causes  3  and  i3 
P  =  L/N 


Npprndix  lor  definitions  of  variables 


FIGURE  7  -  AID  TREE  FOR  FT-RA  (L9/L) 


L/N 


See  Appendix  Cor  tk'finil  ions  of  va  i  i;il>l<'s 
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FIGURE  6-  AID  TREE  FOR  CAREER  (LG-8/L)  (FY  1970) 


FIGURE  9  -  AID  TREE  FOR  CAREER- RETIREMENT  (L2.3/L) 


N  =  Total  Losses 
L  =  Losses  for  Cause  2.3 
P  =  L/N 


-See  Appendix  for  definitions  of  •  va  riables. 
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FIGURE  10;  AID  TREE  FOR  FT-AUS 

Loss  Type:  All  losses 
Transactions:  1-8,  10,  11 
KMF  Exlract  Date:  Start  FY73 
Gain/Loss  Extract  Period:  FY73 
Sample  Size:  10u/o 
BSS/TSS:  .649 


"soe  pporid  ix  tor-  dct’inil  ions  of  variables 

N  =  Population 
P  =  Loss  Rate 
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FIGURE  11  -AID  TREE  FOR  FT-RA 

(FY  1973) *  - 

J-.OKH -Typi;:  All  Josses  N-i 

Transactions:  1-8,  10,  11 

KM1-’  Extract  Date:  Start  FY73  P=* 

Gain/Loss  Extract  Period:  FY73  J 

Sample  Size:  10%  / 

BSS/TSS:  .365  13-90  /  E 


N-32359  ^ 
P=.  3139 

’  \ 
ETS 


N- 2  59  3  IS 
P=.  183G 


N=G421 
P=.  0402 


/  Educational 
3-0/  level  \  0-2 


2-0  /  TAS 


9-999 


N=17592 


- P  £ 

N "15590  L  N-1994 
I»=.  1256  P*.  3004 


1  /  Sex  ^  2 
N  14014  N=784 


P".  1150 


Grade 


P.=.  3099 


N-0754 
P-.  0021 


Nr60G0 
P*.  1645 


TAS  \  10-999 


(5 

N  =  0346 

P=.  2G23 

3-7 

Grade 

N-5 

P". 

203 

2047 

N=27 
P=.  1111 


N=6394 
P=.  0433 


N -3143 
P=.  3576 


10-999 

mm 


N=2314 
P-.  2835 


N=829 
P=.  5645 


N=G09  1 

P*.  503G 


N-140 
P=-.  0C4  3 


"See  Appendix  for  definitions  of  variables. 


N  -  5725 
P\  1521 


N=335 

P=.  3701 

17 

1 _ J 

N  =  Population 
P  =  Loss  Rate 


F1GUIIE  13-  AID  TIRE  TOR  CAR1XJUST  (LR/L) 
(FY  1973)  * 


N  =  All  losses 
p  s  Losses  by  Retirement 
A1 1  Losses 


'  Sit  \ppcndix  for  definitions  of  variables.  , 
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TABLE  8 

COMPARISON  OF  ACTUAL  AND  PREDICTED  LOSSES  (OR  GAINS) 
ENLISTMENT  CATEGORY:  FT-AUS  (FY73) 


TRANSACTION  CATEGORY:  (L/P)  All  losses 
AID  RUN  DATE:  15  November  1973 
AID  RUN  NUMBER:  L/P 
SAMPLE  SIZE  FOR  AID  RUN: 

Random  or  Not  Random 

Sample  Number;  Size;  No.  of  Obs.  Read  in;  Sample  #5,  10%  (9797) 

SAMPLE  SIZE  FOR  COMPUTING  ACTUAL  LOSSES  (OR  GAINS) 
Comparison  Run  Date:  30  November  1973 
Random  or  Not  Random 
Sample  Number  &  Size:  Sample  #7;  10% 

BSS/TSS:  .649 

MEAN:  .7611 


Cell  Number 

Loss 

Probability 

Number 

of 

Persons 

Predicted 

Losses 

Actual 

Losses 

B  , 

BXXX^ 

i 

10 

.0872 

1588 

138 

143 

+  3.  5% 

2 

11 

.  3793 

72 

27 

18 

-50. 0% 

3 

7 

.2615 

490 

130 

126 

-3.2% 

4 

4 

.  6458 

55 

36 

30 

-16.7% 

5 

8 

.  9087 

1296 

1178 

1145 

-2.  9% 

6 

9 

.  9641 

6111 

5892 

5852 

-.7% 

Total 

9620 

7401 

7314 

-l.  2% 

Predicted  Overall  Loss  Rate  =  .7693 
Actual  Overall  Loss  Rate  =  .7603 

♦Actual  -  Predicted 
Actual 
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TABLE  9 


COMPARISON  OF  ACTUAL  AND  PREDICTED  LOSSES  (OR  GAINS) 
ENLISTMENT  CATEGORY:  FT-RA  (FY73) 


TRANSACTION  CATEGORY:  All  losses 

AID  RUN  DATE:  15  November  1973 

AID  RUN  NUMBER:  L/P 

SAMPLE  SIZE  FOR  AID  RUN: 

Random  or  Not  Random 

Sample  Number;  Size;  No.  of  Obs.  Read  in:  Sample  #5,  10%  (32500) 

SAMPLE  SIZE  FOR  COMPUTING  ACTUAL  LOSSES  (OR  GAINS): 
Comparison  Run  Date:  30  November  1973 
Random  or  Not  Random 
Sample  Number  &  Size:  Sample  #5;  10% 

BSS/TSS:  .365 

MEAN:  .3139 


Terminal 

Cell 

Number 

Cell  Number 

Loss 

Probability 

Number 

of 

Persons 

Predicted 

Losses 

Actual 

Losses 

Percent* 

Error 

1 

14 

.0821 

8529 

700 

691 

-  1.3 

2 

6 

.1111 

21 

2 

5 

+60.0 

3 

16 

.  1521 

5886 

895 

931 

+  2.0 

4 

10 

.  2047 

5346 

1094 

1005 

-  8.9 

5 

18 

.  2835 

2484 

704 

758 

+  7.  1 

6 

9 

.  3084 

1870 

577 

629 

+  8.  3 

7 

13 

.  3099 

783 

243 

229 

-  6.  1 

8 

17 

.  3761 

288 

108 

86 

-25.  6 

9 

20 

.  5036 

671 

338 

323 

-  4.6 

10 

7 

.  8433 

6474 

5459 

5472 

+  .  2 

11 

21 

.  8643 

148 

128 

119 

-  7.6 

Total 

32500 

10248 

10230 

.2 

Predicted  Overall  Loss  Rate  =  .3153 
Actual  Overall  Loss  Rate  =  .3148 


-Actual  -  Predicted 
Actual 
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TABLE  10 


COMPARISON  OF  ACTUAL  AND  PREDICTED  LOSSES  (OR  GAINS) 
ENLISTMENT  CATEGORY:  CAREERISTS  (FY73) 


TRANSACTION  CATEGORY:  All  losses 
AID  RUN  DATE:  25  November  1973 
AI  D  RUN  NUMBER:  L/P 
SAMPLE  SIZE  FOR  AID  RUN: 

Random  or  Not  Random 

Sample  Number;  Size;  No,  of  Obs.  Read  in:  Sample  #5;  10%  (26,  528) 
SAMPLE  SIZE  FOR  COMPUTING  ACTUAL  LOSSES  (OR  GAINS): 

Comparison  Run  Date:  29  November  1973 

Random  or  Not  Random 

Sample  Number  &  Size:  Sample  #7;  10%  (26,528) 

BSS/TSS:  .338 
MEAN:  .  1864 


Terminal 

Cell 

Number 

Cell  Number 

Loss 

Probability 

Number 

of 

Persons 

Predicted 

Losses 

Actual 

Losses 

Percent* 

Error 

1 

16 

16002 

692 

694 

+  .  3 

2 

28 

.  1624 

1876 

305 

358 

+  14.  8 

3 

20 

.  1820 

1885 

343 

321 

-  6.8 

4 

10 

WPfflU 

850 

194 

171 

-13.  5 

5 

17 

.  3191 

176 

56 

54 

-  1.8 

6 

26 

.  3327 

922 

307 

333 

+  7.8 

7 

21 

,  4554 

433 

197 

184 

-  7.  1 

8 

22 

.  4846 

486 

236 

233 

-1.3 

9 

24 

503 

255 

243 

-  4.  9 

10 

27 

424 

220 

179 

-22.  9 

11 

29 

.  5733 

74 

42 

36 

-16.6 

12 

19 

.6159 

137 

84 

76 

-10.  5 

13 

25 

.  6765 

934 

632 

624 

-1.3 

14 

23 

.  6865 

827 

568 

571 

+  .  5 

15 

15 

.  8435 

754 

636 

658 

+  4.  9 

Total 

26283 

4767 

4730 

-  .  8 

I 


Predicted  Overall  Loss  Rate  =  .1814 
Actual  Overall  Loss  Rate  =  .  1800 

‘-Actual  -  Predicted 
- Actual -  100  43 


TARIFF  11 


COMPARISON  OF  ACTUAL  AND  PREDICTED  LOSSES  (OR  GAINS) 
USING  FY70  AID  TREES  TO  PREDICT  FY73  LOSSES 
ENLISTMENT  CATEGORY:  FT-AUS 

TRANSACTION  CATEGORY:  All  Losses 

AID  RUN  DATE:  July  26,  1973 

AID  RUN  NUMBER:  E-7  (L/P) 

SAMPLE  SIZE  FOR  AID  RUN: 

Random  or  Not  Random 

Sample  Number  &  Size:  No.  of  Obs.  Read  in  :  Sample  #1, 
10%  (47,  153) 

SAMPLE  SIZE  FOR  COMPUTING  ACTUAL  LOSSES  (OR  GAINS): 
Comparison  Run  Date:  December  5,  1973 
Random  or  Not  Random 
Sample  Number  A  Size  H  7;  (9,  800) 

RSS/TSS:  .631 

MEANS:  .5724 


Te  rni  inal 
Cell 

(\u  mbcr 

Cell 

N  uni  be  r 

Loss 

Probability 

Number 

of 

Pe  rsons 

Predicted 

Losses 

Actual 

Losses 

Percent 

E  rror 

1  , 

4 

.  0892 

1590 

142 

160 

111.  2 

2 

8 

.  2  485 

524 

130 

75 

73.  3 

» > 

.  i 

9 

.  5336 

159 

88 

43 

-104.  6 

4 

10 

.  7073 

614 

436 

524 

16.  8 

5 

7 

.  <1489 

6913 

6560 

6472 

1.  3 

TOTAL 

1  , 

9800 

7356 

7274 

p— • 

Predicted  Overall  Loss  Rate  =  .  7506 
Actual  Overall  Loss  Rate  =  .  7422 

Actual  -  Predicted  jqq 


Actual 

Note:  See  Figure  2  and  Table  4  for  AID  tree. 
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TABLE  12 


COMPARISON  OF  ACTUAL  AND  PREDICTED  LOSSES  (OR  GAINS) 
USING  FY70  AID  TRESS  TO  PREDICT  FY73  LOSSES 
ENLISTMENT  CATEGORY:  FT-RA 

TRANSACTION  CATEGORY:  All  Losses 

AID  RUN  DATE:  August  6,  1973 

AID  RUN  NUMBER:  E-4  (L/P) 

SAMPLE  SIZE  FOR  AID  RUN: 

Random  or  Not  Random 

Sample  Number  &  Size:  No.  of  Obs.  Read  in:  Sample  #1: 
55.  705  (10%) 

SAMPLE  SIZE  FOR  COMPUTING  ACTUAL  LOSSES  (OR  GAINS): 
Comparison  Run  Date:  December  5,  1973 
Random  or  Not  Random 

Sample  Number  &  Size:  Sample  #7;  32,  362  (10%) 

BSS/TSS:  .  416 
MEANS:  .3516 


Terminal 

Cell 

Number 

1  Cell 

t 

Number 

Loss 

I  Probability 

[ 

Number 
!  of 
|  Persons 

j  Predicted 
i  Losses 

i 

Actual 

Losses 

Percent 

Error 

1 

'  2 

.  1520 

j  25020 

3803 

3518 

-8.  1 

4 

.  6161 

|  4246 

2  616 

1 

3024 

13.  5 

3 

5 

.  8984 

3096 

2781 

f 

2442 

13.  9 

TOTAL 

32, 362 

9200  ! 

8984 

2.  4 

Predicted  Overall  Loss  Rate  =  .  2843 
Actual  Overall  Loss  Rate  =  .2776 

Actual  -  Predicted 
Actual 

Note  See  Figures  4  It  5  and  Tables  6  &  7  for  AID  Tree. 
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TABLE  13 


COMPARISON  OF  ACTUAL  AND  PREDICTED  LOSSES  (OR  GAINS) 

USING  FY70  AID  TREES  TO  PREDICT  FY  73  LOSSES 
ENLISTMENT  CATEGORY:  CAREERISTS  INELIGIBLE  FOR  RETIREMENT 

TRANSACTION  CATEGORY:  All  Losses 

AID  RUN  DATE:  August  6,  1973 

AID  RUN  NUMBER:  E-2  (L/P) 

Random  or  Not  Random 

Sample  Number  &  Size;  No.  of  Obs.  Read  in  Sample  #1;  23,027  (10%) 
SAMPLE  SIZE  FOR  COMPUTING  ACTUAL  LOSSES  (OR  GAINS): 


Comparison  Run  Date:  December  5,  1973 
Random  or  Not  Random 


BSS/TSS- 

MEANS: 

Sample  Number  &  Size 

.  413 
.  1604 

Sample 

#7;  22,  757  (10%) 

Terminal 

Cell 

Number 

j  Cell 
i  Numbet 

j  Loss  1 

•  Probability! 

I 

-  J 

Number 

of 

Persons 

Predictedj 

Losses 

Actual 

Losses 

Percent  '• 

Error 

1 

8 

.0476 

17908 

852 

806 

-5.  7 

2 

9 

.  3084 

231 

71  ! 

80 

11.  2 

3 

7 

.  3545 

62  8 

223 

251 

11.  2 

4 

12 

.  0847 

1651 

140 

161 

13  0 

5 

7 

.  3542 

387 

137 

137 

0.  0 

6 

10 

.  5817 

1155 

674 

689 

2.  2 

7 

11 

.  8560 

797 

682 

614 

-11.  1 

TOTAL 

22, 757 

2,779  2,738 

-1.  5 

Predicted  Overall 

Loss  Rate  =  . 

1221 

Actual  Overall  Loss  Rate  -  .  1203 

Actual  -  Predicted 
Actual 


Note:  See  Figure  3  and  Table  5  for  AID  Tree. 


Chapter  4 

RECOMMENDATIONS 


(a)  The  results  of  this  study  show  that  the  AID-E  is  the  best  available 
method  for  predicting  loss  rates:  its  margin  or  error  for  each 
enlistment  category  is  less  than  1.  5%  in  FY73  and  less  than  3.  5% 
in  FY70.  For  year-to-year  predictions,  the  AID  trees  developed 
from  FY70  data  were  used  to  predict  losses  in  FY73.  The  results 
show  that  the  prediction  errors  are  less  than  2.  4%.  For  this  and 
other  reasons  (among  them,  consistency  and  flexibility  of  the  system), 

O E-TEMPO  recommends  that  the  AID-E  be  designated  the  official 
method  for  supplying  loss  rates  to  all  the  Army's  manpower  prediction 
m odcls. 

(b)  The  AID-E  system  is  best  used  to  produce  annual  loss  rates. 

Croat  care  must  be  exercised  in  converting  its  annual  rates  into 
monthly  or  quarterly  rales.  In  order  to  preserve  consistency  in  the 
use  of  AID  loss  rates  in  all  manpower  models,  the  Army  cannot  be 

too  strongly  urged  to  designate  an  official  agency  in  charge  of  providing 
All)  loss  rates  for  monthly,  quarterly,  and  annual  manpower  prediction 
systems. 

(c)  The  AID-E  is  not  only  valuable  in  producing  loss  rates,  but  is 

also  an  excellent  system  to  provide  immediate  information  for  various 
Army  manpower  studies.  Any  data  elements  contained  in  both  the 
EMF  and  the-  Cain/ Loss  Transaction  Files  can  be  readily  summarized, 
analyzed,  and  tabulated:  for  example,  (1)  the  problem  of  AWOL  and 
MOS  imbalance /mismatch;  (2)  distribution  of  the  enlisted  population 
by  age,  sex,  geographic  origin,  marital  status,  performance  index, 
and  others.  CE-TEMPO  recommends  that  the  AID-E  be  designated 
as  a  key  system  to  be  used  in  manpower  policy  making  studies.  The 
system  is  capable  of  processing  large  quantities  of  data  to  produce 
accurate  information  in  a  timely  manner.  It  should  be  noted  that 
CE-TEMPO  has  already  installed  the  AID-E  in  an  Army  facility. 

(d)  The  IX'SPER-46  Report  (the  official  report  of  the  Army's  manpower’ 
strength  status)  can  be  improved  in  reporting  losses  in  a  significant 
respect.  It  reports  only  the  number  of  incidents  of  loss  from  various 
causes.  If  a  person  takes  five  DFMC's  (drop  from  military  control)  and 
subsequently  returns,  five  losses  arc  reported.  This  practice  is 
acceptable  if  the  Army  is  interested  simply  in  the  number  of  incidents, 
but  it  obscures  the  true  number  of  persons  lost.  CE-TEMPO 
recommends  that,  if  the  Army  continues  the  present  practice  of 
counting  incidents,  the  DCSPER-46  Report  be  modified  to:  (1)  clearly 
distinguish  permanent  losses  from  incidents  (temporary)  of  loss 

and  (2)  include  the  number  of  persons  versus  the  number  of  incidents 
for  loss  categories  such  as  DFMC  where  more  than  one  loss  for  each 
individual  can  occur.  The  information  for  both  number  of  incidents  of 
loss  and  number  of  persons  lost  is  of  value  depending  on  the  purpose. 
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Testing  of  AID-E  rates  by  GE-TEMPO  has  been  most  promising, 
but  the  evaluation  of  accuracy  and  validity  of  the  AID-E  loss 
rates  should  be  continued  by  the  Army.  Such  extensive  testing 
and  operational  use  of  the  AID-E  system  will  highlight  the 
strengths  and  shortcomings  of  rates  generated.  An  office 
should  be  identified  to  document  these  strengths  and  short¬ 
comings  for  user  guidance. 


APPENDIX 


Definitions  of  Variables  Used  in  Figures  2-13 

ETS  -  Number  of  months  remaining  prior  to  expiration  of  term  of 
Service 

TAS  -  Number  of  months  in  total  active  military  service 

Major  Command  -  The  command  or  agency  to  which  an  individual 
is  assigned. 

Code  Meaning 

01  First  US  Army 

02  Third  US  Army 

03  Fifth  US  Army 

04  Sixth  US  Army 

16  US  Air  Defense  Command 

14  The  Adjutant  General 

05  US  Army  Alaska 

15  Army  Security  Agency 

14  Army  Audit  Agency 

06  US  Army,  Southern  Command  (other  than  Antilles  Cmd.) 

06  US  Army,  Antilles  Command 

01  HQ's  US  Army  Continental  Army  Command 

13  US  Army  Strategic  Communications  Command 

13  US  Army  Combat  Development  Command 

14  Chief,  Engineers 

14  Office,  Chief  of  Staff,  Army 

07  HQ's  US  Army,  Europe 

07  US  Army  Theater  Army  Support  Command,  Europe 

07  SETA  F 

07  Berlin  Command 

07  Seventh  US  Army 

13  US  Army  Intelligence  Command 

12  US  Army  Material  Command 

12  US  Army  Electronics  Command 

12  US  Army  Missile  Command 

12  US  Army  Tank-Automoti ve  Command 

12  US  Army  Munitions  Command 

12  US  Army  Aviations  Systems  Command 

12  US  Army  Test  &  Evaluations  Command 

12  US  Army  Weapons  Command 

12  US  Army  Mobility  Equipment  Command 

13  The  Surgeon  General 

13  Military  Traffic  Management  Terminal  Service 
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Code  Meaning 


14  Military  District  of  Washington 

08  HQ's  US  Army,  Pacific 

08  US  Army,  Hawaii 

09  US  Army,  Japan 

09  US  Army,  Ryukyu  Islands 

10  HQ's  US  Army,  Vietnam 

10  HQ's  US  Army  Control,  Thailand 

11  US  Army  Forces,  Taiwan 

09  Eight  US  Army 

14  Office  of  the  Secretary  of  the  Army 

14  Dept,  of  Defense  &  US  Army  Elements  &  Joint  Activities 

14  Other  Field  Activities  of  the  Army  Staff 

14  Chief,  Support  Services 

00  Blank  or  Invalid 


Functional  Status  -  Status  of  active  personnel  by  state  of  readiness 

for  use  in  specific  roles,  missions,  and 
activities. 


Code 


Meaning 


1 

1 

1 

1 

1 

1 

1 

3 
2 

4 
1 

3 
1 

5 
2 
2 
1 
2 

4 
1 
2 
4 
9 


US  Army  Air  Defense  Command  (Admin,  elements) 
US  Army  Air  Defense  Command  (Oper.  elements) 
STRAF- readiness  capability  (REDCAPE) 


II 

U 


II 

II 


Continenlal  United  States  (operations) 

Patients 

En  Route 

Personnel  entering  military  service 
Special  foreign  activities 
Foreign  Patients 
Joint  Task  Force 

US  Army  Correctional  Holding  Detachment 

Personnel  in  process  of  separation 

Replacements 

Reimbursable  activities 

Returnees  for  reassignment 

Skills  Development  Base  student 

Overseas  Troops 

Transient 

Students 

Invalid  or  Blank 
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Grade  -  Pay  Grade 


Code  Meaning 


1 

2 

3 

4 

5 

6 

7 

8 
9 


E-l 

E-2 

E-3 

E-4 

E-5 

E-6 

E-7 

E-8 

E-9 


ETS-DEROS  -  Number  of  months  left  in  an  individual's  ETS  after 
returning  from  overseas  ETS. 


Educational  Level  -  The  highest  level  of  civilian  education  attained 

by  an  enlisted  man. 

Code  Meaning 

0  No  formal  education 

0  One  year  elementary  school  completed 

0  Two  years  elementary  school  completed 

0  Three  years  elementary  school  completed 

0  Four  years  elementary  school  completed 

0  Five  years  elementary  school  completed 

0  Six  years  elementary  school  completed 

0  Seven  years  elementary  school  completed 

0  Eight  years  elementary  school  completed 

1  One  year  High  School  completed  (9th) 

1  Two  Years  High  School  completed  (10th) 

2  Three  years  High  School  completed  (11th) 

2  Four  years  High  School  completed  (12th)  but  did  not 
graduate. 

3  High  School  graduate  (irrelevant  to  number  of  years 
completed) 

4  General  Education  Development  Level  (High  School) 

5  General  Education  Development  Level  (College) 

5  One  year  college  completed 

5  Two  Years  college  completed 

(also  graduate  of  2  yr.  college) 


Code  Meaning 

5  Three  years  college  completed 

5  Pour  years  college  completed  but  did  not  graduate 

6  College  graduate  {irrelevant  to  years  comp. ) 

7  Bachelor  of  Laws  (LLB) 

7  Doctor  of  Laws  (LLD) 

7  Juris  Doctor  (J.  D. ) 

7  Doctor  of  Juridical  Science  (JJ.S) 

7  Graduate  work  of  one  year  or  more  completed,  no 

degree 

7  Masters  Degree 

7  Doctorate  Degree 

7  Other  Professional  degree 

8  Blank  or  Invalid 


Prop- Pays  -  Proficiency  Pays  which  are  the  ratings  given  to 

enlisted  personnel  eligible  to  receive  pays  related 
to  his  skill  or  performance. 

Code  Meaning 

0  Withdrawal  of  Propay 

1  P-1,  the  lowest  level  of  skill-related  propay 

2  P-2,  the  second  lowest  level  of  skill-related  propay 

3  P-3,  the  highest  level  of  skill-related  propay 

1  P-4,  Superior  Performance  Pay 

0  U  nknown  Propay 

DEROS  -  Number  of  months  remaining  from  the  EMF  date  to 
the  date  on  which  an  individual  is  eligible  to  return 
to  CONUS  or  area  of  residence  from  overseas.  A 
person  not  in  overseas  has  zero  month  in  DEROS. 

Term  -  Term  of  service  in  terms  of  months. 

Race  -  Race  of  an  enlisted  personnel. 

Code  Meaning 

1  Caucasian 

2  Negro 

3  Other  &  Unknown 
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Sex  -  Sex  of  an  enlisted  personnel. 

Code  Meaning 

1  Male 

2  Female 

3  Unknown 

4  Error 

ECAT  -  Enlistment  category  which  assigns  an  enlisted  man 
into  one  of  the  following  codes: 

Code  Meaning 

1  First-term  Army  of  the  United  States  (draftees) 

2  First-term  Regular  Army 

3  Careerists  ineligible  for  retirement 

4  Careerists  eligible  for  retirement 

Age  -  Age  of  an  enlisted  man  in  terms  of  months. 

Residence  (RES)  -  State  or  geographical  area  of  residence 
at  the  time  of  entry  into  the  Army. 


FIRST  U.S.  ARMY 


STATE  CODE 


Connecticut  7 

Delaware  8 

Kentucky  1 

Maine  7 

Maryland  8 

Massachusetts  7 

New  Hampshire  7 

New  Jersey  2 

New  York  2 

Ohio  0 

Pennsylvania  2 

Rhode  Island  2 


53 


STATE 


CODE 


V  ermont  7 

Virginia  8 

West  Virginia  8 

THIRD  U.S.  ARMY 

STATE  CODE 


Alabama  1 

Florida  8 

Georgia  8 

Kentucky  1 

Mississippi  1 

North  Carolina  8 

South  Carolina  8 

Tennessee  1 

FIFTH  U.S.  ARMY 

STATE  CODE 


Arkansas  4 

Illinois  0 

Indiana  0 

Iowa  0 

Kansas  5 

Louisiana  4 

Michigan  0 

Minnesota  9 

Missouri  5 

Nebraska  5 

New  Mexico  3 

Oklahoma  4 

Texas  4 

Wisconsin  5 


54 


SIXTH  U.S.  ARMY 


STATE  CODE 


Arizona  3 

California  6 

Colorado  3 

Idaho  3 

Montana  3 

Nevada  6 

North  Dakota  9 

Oregon  3 

South  Dakota  5 

Utah  3 

Washington  3 

Wyoming  3 


U.S.  ARMY  MILITARY  DISTRICT  OF  WASHINGTON 


CODE 


District  of  Columbia  2 

Maryland  8 

V  irginia  8 

Alaska  9 

Hawaii  9 


